Manuscripts in Time and Space: Experiments in Scriptometrics on an Old French Corpus

نویسنده

  • Jean-Baptiste Camps
چکیده

Witnesses of medieval literary texts, preserved in manuscript, are layered objects, being almost exclusively copies of copies. This results in multiple and hard to distinguish linguistic strata – the author’s scripta interacting with the scriptae of the various scribes – in a context where literary written language is already a dialectal hybrid. Moreover, no single linguistic phenomenon allows to distinguish between different scriptae, and only the combination of multiple characteristics is likely to be significant [9] – but which ones? The most common approach is to search for these features in a set of previously selected texts, that are supposed to be representative of a given scripta. This can induce a circularity, in which texts are used to select features that in turn characterise them as belonging to a linguistic area. To counter this issue, this paper offers an unsupervised and corpus-based approach, in which clustering methods are applied to an Old French corpus to identify main divisions and groups. Ultimately, scriptometric profiles are built for each of them.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

P-69: Expression of Leptin Receptor mRNA in Ovine Corpus Luteum

Background: Many hormones are involved in the regulation of reproduction. Leptin hormone which is mainly secreted by adipose tissue plays an important role in energy homeostasis and reproduction. It seems that leptin is an important linkage between body metabolism and reproductive system. Moreover, it has been shown that leptin and leptin receptor express in reproductive organs of some species....

متن کامل

Ultrastructural Changes of Corpus Luteum after Ovarian Stimulation at Implantation Period

Background: To achieve multiple oocytes for in vitro fertilization, ovulation induction is induced by gonadotropins however, it has several effects on oocytes and embryo quality and endometrium receptivity. The aim of this study was to assess ultrastructural changes of corpus luteum after ovarian induction using human menopausal gonadotropin (HMG) and human chorionic gonadotropin (HCG) during l...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

در کاربرد تشخیص زبان گفتاری GMM-VSM در قالب سیستم GMM

GMM is one of the most successful models in the field of automatic language identification. In this paper we have proposed a new model named adapted weight GMM (AW-GMM). This model is similar to GMM but the weights are determined using GMM-VSM LID system based on the power of each component in discriminating one language from the others. Also considering the computational complexity of GMM-VSM,...

متن کامل

بررسی اثرات عصاره الکلی گیاه Ruta graveolens بر عملکرد سیستم تولید مثل موشهای ماده نابالغ نژاد Balb/c

    Background & Aim: Ruta graveolens(R.G) is currently used by Middle East countries for its antispasmodic, diuretic and sedative effects. Based on recent experiments R.G has antifertility activity in mice when administrated orally. This work was undertaken to examine the possible effect of alcoholic extract of R.G on reproductive system in female immature mice. Material and Methods: In this e...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.01429  شماره 

صفحات  -

تاریخ انتشار 2018